Language Identification by Using Syllable-Based Duration Classification on Code-Switching Speech
نویسندگان
چکیده
Many approaches to automatic spoken language identification (LID) on monolingual speech are successfully, but LID on the code-switching speech identifying at least 2 languages from one acoustic utterance challenges these approaches. In [6], we have successfully used one-pass approach to recognize the Chinese character on the Mandarin-Taiwanese code-switching speech. In this paper, we introduce a classification method (named syllable-based duration classification) based on three clues: recognized common tonal syllable tonal syllable, the corresponding duration and speech signal to identify specific language from code-switching speech. Experimental results show that the performance of the proposed LID approach on code-switching speech exhibits closely to that of parallel tonal syllable recognition LID system on monolingual speech.
منابع مشابه
Language identification of code Switching sentences and multilingual sentences of under-resourced languages by using multi structural word information
Language identification (LID) is a process to identify the languages used in a text or speech. Code switching is the switching of a language in a sentence or speech utterance. This paper focuses on LID of words in code switching sentences. Code switching can occur intersentential or intrasentential. The reasons why a writer switches from one language to another due to various reasons and among ...
متن کاملFunctions of Code-Switching Strategies among Iranian EFL Learners and Their Speaking Ability Improvement through Code-Switching
This study investigated the impact of code-switching on speaking ability of Iranian low proficiency EFL learners. Moreover, it was an attempt to show what functions existed behind code-switching strategies used by the EFL learners. To this end, 60 male and female Iranian EFL learners age-ranged between 20 and 30 participated in the study. Data collection instruments which were used were the Int...
متن کاملLanguage identification on code-switching utterances using multiple cues
Code-switching speech is an utterance containing two or more languages. Usually, the switching linguistic unit is in clause or word levels. In this paper, a two-stage framework is proposed, containing a language identifier and then a speech recognizer, to evaluate on a Mandarin-Taiwanese codeswitching utterance. In the language identifier, we use multiple cues including acoustic, prosodic and p...
متن کاملLanguage identification of code switching Malay-English words using syllable structure information
This paper introduces a language identification approach using syllable structure information. We also review and compare other approaches. Most of these approaches use linguistic information for language identification. The information used for language identification is Malay affixation information, English vocabulary list, alphabet ngram, grapheme n-gram. The approach using syllable structur...
متن کاملCode-Switching event detection based on delta-BIC using phonetic eigenvoice models
This paper presents a new paradigm for code-switching event detection based on delta Bayesian Information Criterion (∆BIC). First, an automatic speech recognizer (ASR) and an articulatory feature (AF) detector are constructed. The intersyllable boundaries obtained from the ASR are regarded as the potential code-switching boundaries. To estimate the language likelihood, eigenvoice models (EVMs) ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006